GWAS Strategy
GWAS was run using MLM model in GCTA1.93.2. Note that I tried different strategies to directly fit covariates or pre-adjust phenotypes by the covariates. The beta correlation between different strategies will be shown for Manhattan Plots as below.
Take SRS_RMB_sum in Probands (with FSIQ included as covariate) for example, I tried:
- Strategy 1: Phenotype pre-adjusted by age, sex, chip, FSIQ
- Strategy 2: Phenotype pre-adjusted by age, sex, chip, FSIQ and 20 PCs
- Strategy 3: directly fit age, sex, chip, FSIQ
- Strategy 4: directly fit age, sex, chip, FSIQ and 20 PCs
Note that considering the GWAS sample size, computational time and false positive rates, we will report the results below:
- based on Strategy2 for the same phenotype
- based on sum measurement for the same phenotype
Probands
All Individuals
- GWAS was run on all individuals including diverse ancestry backgrounds.
- Signals with association p-value < 1e-5 will be shown for Manhattan Plots.
QQ Plot
Primary Variable
Fitting FSIQ

Not fitting FSIQ

Secondary Variable
Fitting FSIQ

Not fitting FSIQ

Manhattan Plot
Primary Variable
Fitting FSIQ

Not fitting FSIQ

Secondary Variable
Fitting FSIQ

Not fitting FSIQ

Europeans Only
- 6861972 QCd SNPs with MAF > 0.01 included
- 1946 European individuals are included
- Signals with association p-value < 1e-5 will be shown for Manhattan Plots.
QQ Plot
Primary Variable
Fitting FSIQ

Not fitting FSIQ

Secondary Variable
Fitting FSIQ

Not fitting FSIQ

Manhattan Plot
Primary Variable
Fitting FSIQ

Not fitting FSIQ

Secondary Variable
Fitting FSIQ

Not fitting FSIQ

Probands & Unaffected Siblings
- Combined phenotypes for Probands and Unaffected Siblings are pre-adjusted by covariates (20 PCs, sex, chip and ASD diagnosis) and then RINT.
- Based on the phenotype distribution, we run GWAS on combined data.
Note that there is no available information for FSIQ and Age for Unaffected Siblings, thus FSIQ and Age will not be included as covariates for the combined data analysis.
All Individuals
- GWAS was run on all individuals including diverse ancestry backgrounds.
- Signals with association p-value < 1e-5 will be shown for Manhattan Plots.
QQ Plot
Primary Variable

Secondary Variable

Manhattan Plot
Primary Variable

Secondary Variable

Europeans Only
- GWAS was run on European individuals with N=3544.
- Signals with association p-value < 1e-5 will be shown for Manhattan Plots for Manhattan Plots.
QQ Plot
Primary Variable

Secondary Variable

Manhattan Plot
Primary Variable

Secondary Variable

Heritability Estimation
Probands
- Unrelated Individuals (GRM < 0.05) were used for performing GREML analysis.
- Using different models based on phenotypes:
- Est.1-1: pre-adjust by Age, Sex and Chip
- Est.1-2: directly fit covariates: Age, Sex and Chip
- Est.2-1: pre-adjust by Age, Sex, Chip and 20PCs
- Est.2-2: directly fit covariates: Age, Sex, Chip and 20PCs
- Est.3-1: pre-adjust by Age, Sex, Chip and FSIQ
- Est.3-2: directly fit covariates: Age, Sex, Chip and FSIQ
- Est.4-1: pre-adjust by Age, Sex, Chip, FSIQ and 20PCs
- Est.4-2: directly fit covariates: Age, Sex, Chip, FSIQ and 20PCs
Primary Variable

Secondary Variable

Probands & Unaffected Siblings
- Unrelated Individuals (GRM < 0.05) were used for performing GREML analysis.
- Using different models based on phenotypes:
- Est.1-1: pre-adjust by ASD diagnosis, Sex and Chip
- Est.1-2: directly fit covariates: ASD diagnosis, Sex and Chip
- Est.2-1: pre-adjust by ASD diagnosis, Sex, Chip and 20PCs
- Est.2-2: directly fit covariates: ASD diagnosis, Sex, Chip and 20PCs
Primary Variable

Secondary Variable

Genetic Correlation
Probands
RRBs
RRBs VS. Public
Probands & Unaffected Siblings
RRBs
RRBs VS. Public
[Problems for REML, unrelated EURs for combined data is only ~2K, similar magnitude as probands.]